RESOURCE/METHODOLOGY 



Mutually exclusive signaling signatures 
define the hepatic and pancreatic 
progenitor cell lineage divergence 

Elisa Rodriguez-Seguel, 1 Nancy Mah, 2 Heike Naumann, 1 Igor M. Pongrac, 1 Nuria Cerda-Esteban, 1 
Jean-Fred Fontaine, 2 Yongbo Wang, 3 Wei Chen, 3 Miguel A. Andrade-Navarro, 2 
and Francesca M. Spagnoli 1,4 

laboratory of Molecular and Cellular Basis of Embryonic Development, Computational Biology and Data Mining, laboratory 
for Novel Sequencing Technology, Functional and Medical Genomics, Max Delbriick Center for Molecular Medicine, D- 13092 
Berlin, Germany 



Understanding how distinct cell types arise from multipotent progenitor cells is a major quest in stem cell biology. 
The liver and pancreas share many aspects of their early development and possibly originate from a common 
progenitor. However, how liver and pancreas cells diverge from a common endoderm progenitor population and 
adopt specific fates remains elusive. Using RNA sequencing (RNA-seq), we defined the molecular identity of liver 
and pancreas progenitors that were isolated from the mouse embryo at two time points, spanning the period when 
the lineage decision is made. The integration of temporal and spatial gene expression profiles unveiled mutually 
exclusive signaling signatures in hepatic and pancreatic progenitors. Importantly, we identified the noncanonical 
Wnt pathway as a potential developmental regulator of this fate decision and capable of inducing the pancreas 
program in endoderm and liver cells. Our study offers an unprecedented view of gene expression programs in liver 
and pancreas progenitors and forms the basis for formulating lineage-reprogramming strategies to convert adult 
hepatic cells into pancreatic cells. 
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The liver and pancreas share many common aspects of 
their early embryonic development as well as some adult 
functional properties and both are endowed with essen- 
tial metabolic functions (Slack 2007; Zaret 2008). This 
close relationship makes the liver a very attractive tissue 
source for generating new pancreatic (3 cells by lineage 
reprogramming, which might be used in the context of 
cell-based therapy of diabetes. However, how liver and 
pancreas cells diverge from a common endoderm progen- 
itor population and adopt specific fates remains elusive. 

During embryonic development, the pancreas orig- 
inates from distinct outgrowths of the dorsal and ventral 
regions of the foregut endoderm (Spagnoli 2007; Puri and 
Hebrok 2010). Subsequently, the two buds fuse to form 
a single organ containing both pancreatic exocrine and 
endocrine cells (Spagnoli 2007; Puri and Hebrok 2010). 
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The liver originates solely from the ventral foregut 
endoderm, adjacent to where the ventral pancreas (vpa) 
emerges, and the two cell fates are specified concomi- 
tantly by mouse embryonic day 8.5 (E8.5) (Deutsch et al. 
2001). Previous studies showed that the ventral foregut 
endoderm is multipotent for the hepatic and pancreatic 
programs (Deutsch et al. 2001; Tremblay and Zaret 2005; 
Miki et al. 2012). Hepatic and pancreatic endoderm ex- 
press a common set of transcription factors, such as the 
FoxA and GATA factors, and are exposed to the same 
extrinsic signals: fibroblast growth factor (FGF) and BMP 
(Deutsch et al. 2001; Chung et al. 2008; Spagnoli and 
Brivanlou 2008; Zaret 2008). However, the develop- 
mental regulators of the fate choice between liver and 
pancreas are poorly understood. It is also unknown how 
the pancreatic program is stably repressed in nascent 
hepatic progenitors and how the hepatic program is re- 
pressed in pancreatic progenitors. This knowledge will 
aid in not only the programming of stem cells to pancre- 
atic and hepatic lineages, but also the discovery of the 
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mechanisms underlying cellular plasticity between liver 
and pancreas. 

Studies conducted on various stem cell types or whole 
organisms (Tumbar et al. 2004; Wang et al. 2004; Dequeant 
et al. 2006; Mitiku and Baker 2007) have clearly demon- 
strated that significant insights into the regulation of the 
cell fate decision are obtained through global analysis of 
gene expression events. Thus, to search for developmen- 
tal regulators that are involved in the control of the 
pancreas and liver cell fate decision, it is fundamental 
to have a global picture of the regulation of gene expression 
in the mammalian hepatic-pancreatic en do derm lineage. 
To this aim, we performed sequencing-based expression 
profiling (RNA sequencing [RNA-seq]) on hepatic and 
pancreatic progenitors in mice at distinct developmental 
stages. We devised a strategy to purify liver and pancreas 
progenitor populations directly from the endoderm of 
Tg(Proxl-EGFP [enhanced green fluorescent protein]) 
reporter mouse embryos before and after the onset of 
organogenesis. By integrating the temporal and spatial 
gene expression profiles, we found mutually exclusive 
signaling signatures in hepatic and pancreatic progeni- 
tors. Importantly, we identified the noncanonical Wnt 
pathway as a potential developmental regulator of the 
pancreas versus liver fate decision, since it is expressed in 
the foregut endoderm before the cell fate choice is made 
and then is maintained in pancreas progenitors but is 
absent in hepatic progenitors. Moreover, when assayed in 
Xenopus embryos and mammalian cells, activation of the 
noncanonical Wnt pathway is able to promote pancreatic 
fate, suggesting an ancient mechanism for controlling the 
pancreas versus liver fate choice. Overall, our study offers 
an unprecedented view of gene expression programs in 
liver and pancreas progenitors during the defined period of 
their lineage divergence and provides novel insights into 
key mechanisms that may underpin cellular plasticity and 
reprogramming between the two cell types. 

Results 

RNA-seq on FACS-purified hepatic and pancreatic 
progenitors 

To explore the transcription program associated with 
liver and pancreas progenitors in vivo at the time of their 
fate divergence, we performed RNA-seq on distinct 
mouse endoderm progenitor populations isolated at two 
developmental stages. For in vivo monitoring of hepatic 
and pancreatic progenitor cells, we used the transgenic 
mouse line Tg(Proxl-EGFP) Gsat/Mmcd that carries the re- 
porter gene EGFP into the Proxl homeobox gene locus 
(Gong et al. 2003). Proxl is the earliest specific marker in 
common between hepatic and pancreatic endoderm from 
gastrulation onward (Burke and Oliver 2002; Wandzioch 
and Zaret 2009) and therefore is ideally suited for iso- 
lating both hepatic and pancreatic progenitors (Fig. 1). 
We showed that Proxl -EGFP transgenic mouse embryos 
reproduced the endogenous pattern of Proxl expression 
in the endoderm from E7.5 onward (Fig. 1A,B ; Burke and 
Oliver 2002; data not shown). In both ventral and dorsal 



foregut endoderm of Tg( Proxl -EGFP) embryos, the local- 
ization of EGFP perfectly mirrored endogenous Proxl 
expression and overlapped with other endodermal genes, 
such as Foxa2, at E8.5 and with tissue-specific genes, such 
as Pdxl in the pancreas or Liv2 in the liver, from E9.5 
onward (Fig. 1A,B ; Watanabe et al. 2002; Lee et al. 2005; 
Wandzioch and Zaret 2009; Puri and Hebrok 2010). Thus, 
the Tg( Proxl -EGFP) in vivo model enabled us to visualize 
hepatic and pancreatic progenitors under fluorescence 
microscopy before organogenesis had started. Distinct 
regions of the prospective hepatic and pancreatic endo- 
derm were manually microdissected, and Proxl -EGFP + 
cells were FACS-purified and subjected to RNA-seq 
analysis to define their transcriptomes (Fig. 1C-E). Pre- 
vious fate mapping studies in mouse and chick embryos 
suggested differential locations of hepatic and pancreatic 
progenitors in the ventral foregut at the early somite stage 
(Tremblay and Zaret 2005; Miki et al. 2012). To determine 
whether regional identity within the foregut is associated 
with differential gene expression, we collected Proxl - 
EGFP + cells of the whole ventral foregut (referred to as fg) 
and exclusively of the medial ventral foregut (referred to 
as mfg) from E8.5 embryos at the same somite stage 
(seven to nine somites) (Fig. 1C, top). At E10.5, budding 
sites of the liver and pancreas, including both the dorsal 
pancreas (dpa) and vpa, were readily distinguishable, and 
the three distinct progenitor populations were isolated 
and processed for the analysis (Fig. 1C, bottom). 

Given the small number of progenitor cells, we applied 
a submicrogram RNA-seq method and used paired-end 
sequencing technology (Adamidi et al. 2011). A compa- 
rable number of high-quality raw reads was obtained from 
each sample and used to estimate the relative abundance 
of transcripts (Table 1; Trapnell et al. 2010). Principal 
component analysis (PC A) on the nonredundant gene 
expression data showed that individual samples perfectly 
clustered according to their embryonic stage and/or re- 
gion of origin (Fig. 2A). Known hepatic and pancreatic 
tissue-specific transcripts were found in the El 0.5 liver 
and pancreas RNA-seq profiles, respectively, reflecting 
the high reliability and accuracy of the data obtained 
(Fig. IE; Supplemental Fig. 1). 

Integrating temporal and spatial transcription profiles 
of the progenitor populations 

To identify developmental regulators of pancreatic and/or 
hepatic fate, we integrated the temporal and spatial 
analyses. Both hepatic and pancreatic progenitors arise 
from foregut endoderm cells (Spagnoli 2007; Zaret 2008). 
Therefore, the set of genes expressed in the foregut (fg 
or mfg) is expected to include regulators of both early 
pancreatic and hepatic fate specification as well as the 
choice between the two fates (liver and pancreas). We 
compared those genes with the genes expressed later in 
the El 0.5 liver, vpa, and dpa and found a number of genes 
specific to either liver or pancreas that were already 
expressed at the earlier time point in the E8.5 foregut 
(89 and 517 genes, respectively) (Fig. 2B, left). Of interest, 
these numbers suggest a higher divergence between the 
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Figure 1. In vivo isolation and RNA-seq profiling of endoderm progenitor cells from Tg(Proxl-EGFP) mouse embryos. [A) Rep- 
resentative maximum confocal Z-projections of immunofluorescence analysis in Tg(Proxl-EGFP) E8.5 mouse embryos shows EGFP 
reporter expression in both ventral (arrows) and dorsal (asterisk) foregut cells, mirroring endogenous Proxl expression (see the inset 
for overlay) and overlapping with Foxa2 expression domains at the three-somite (3S) stage. Embryos are presented in ventral view. 
[B) In Tg(Proxl-EGFP) E9.5 mouse embryos, EGFP expression was detected in the hepatic and both dorsal and ventral pancreatic 
buds, mirroring the endogenous Proxl (see the inset for overlay). EGFP colocalized with Pdxl in both pancreatic buds and with Liv2 
solely in the liver bud. Embryos are presented in lateral view. (C-E) Schematic representation of cell sampling and the RNA-seq 
procedure. (C) EGFP + fg and mfg endoderm at the seven- to nine-somite (7-9S) stage/E8.5 and liver (lv), ventral (vpa), and dorsal 
(dpa) pancreas at E10.5 were microdissected from Tg(Proxl-EGFP) embryos. Cells were dissociated and subjected to FACS. 
(D) Representative diagram of the EGFP + cell fraction isolated by FACS. The dashed box indicates EGFP + -gated cells, and cells 
negative for EGFP are in purple. Transcript expression was profiled by RNA-seq. (£) Example illustrating the RNA-seq read coverage 
profile of the pancreatic-specific gene Pdxl. The Y-axis indicates the number of read counts in each cell population (Y-axis scale is 
0 1000 counts). FPKM (fragments per kilobase of exon per million fragments mapped) values for Pdxl in each data set are included on 
the right. Exons are depicted as gray boxes at the bottom on the X-axis. As expected, a large number of reads was found in pancreatic 
progenitors. Bars: A, 100 |xm; B, 50 fxm. 



foregut and liver when compared with the number of 
genes whose expression is instead maintained in the 
pancreas (vpa and dpa) (Fig. 2B, left; Supplemental Fig. 
2). This is in line with the hypothesis of the pancreas 
being the default fate of the ventral foregut endoderm 
(Deutsch et al. 2001). In contrast, genes initially showing 
low expression in the foregut but high expression later in 
El 0.5 hepatic or pancreatic progenitors represent cell 
type-specific markers of differentiation and possibly fac- 
tors involved in late aspects of organogenesis (Fig. 2B, 
right; Supplemental Fig. 3). At this later time point, the 
number of genes that are unique to either the dpa or vpa 



(211 vs. 257) (Fig. 2B, right) indicates a large divergence 
between the two buds, reflecting the dual embryological 
origin of the pancreas. 

Different Wnt signaling signatures correlate 
with pancreatic or hepatic progenitor states 

To study the mechanisms underlying the pancreas versus 
liver fate decision, we focused on the transcripts whose 
expression was detected in the E8.5 foregut and main- 
tained exclusively in the pancreatic rudiments and there- 
fore possibly was endowed with a role in this cell fate 
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Table 1. Summary of sequencing data and annotation 
information 



Number of transcripts' 



Sample 


Raw reads 


Mapped reads 


Annotated 


Unannotated 


fg 


121,497,737 


115,767,182 


38,159 


19,348 


mfg 


114,786,211 


108,295,227 


38,254 


19,347 


Liver 


114,436,606 


109,182,827 


38,260 


19,363 


vpa 


109,534,481 


102,391,351 


38,171 


19,371 


dpa 


104,526,851 


99,464,506 


38,168 


19,356 



The transcript abundance was calculated by estimating the frag- 
ments per kilobase of exon per million mapped fragments (FPKM). 
An average of 38,000 annotated distinct transcripts per sample was 
detected. 



a This includes all Cufflinks genes found in each sample before 
filtering. Annotated transcripts were considered to be all Cufflinks 
genes that were associated to a gene symbol. 

restriction. We hereafter refer to this cluster as FP (foregut 
pancreas) (Fig. 2B, left). A gene ontology (GO) term analysis 
of our results revealed significant enrichment for Wnt- 
related "Molecular Function" categories in the group FP, 
such as Wnt protein-binding (GO:0017147; seven genes,- 
P-value < 0.00001.9) and Wnt-activated receptor (GO: 
0042813; five genes,- P-value < 0.00035) activities. Among 
the Wnt-associated factors were genes such as Frizzled 2 
[Fzd2), Fzd4, Ror2, SfrpS, Sfrp2, and Wis (Supplemental 
Table 1). Also, Wnt-related "Biological Process" categories 
were primarily enriched in the group FP, such as estab- 
lishment of epithelial cell polarity (GO: 0090162), Wnt 
receptor signaling pathway (GO: 0007223), and estab- 
lishment of planar polarity (GO: 0001736) (Supplemental 
Table 2). Of interest, no other regions of the Venn diagram 
displayed such a significant enrichment for Wnt-related 
categories as the group FP, suggesting a unique signaling 
signature of the pancreas versus liver fate in the mam- 
malian endoderm. 

We decided to further characterize the expression dy- 
namics of Wnt signal transducers. Interestingly, in our 
data set, we found that intracellular transducers of the 
canonical Wnt/p-catenin signaling (Grigoryan et al. 2008; 
Clevers and Nusse 2012) — such as fl-catenin (Ctnnbl), 
APC, Axin, and TCPs — as well as the common canonical 
and noncanonical component Dishevelled 1 [Dvll ) (Angers 
and Moon 2009; Kikuchi et al. 2009) were expressed at 
stable levels in the different progenitor populations 
(Fig. 2C), whereas transcripts encoding ligands, receptors, 
and coreceptors exhibited lineage-specific induction or 
repression (Fig. 2D). In particular, only noncanonical Wnt 
ligands (Wnt5a, Wnt5b, and Wnt7b) (Kikuchi et al. 2009; 
Grumolato et al. 2010) were found in the foregut and 
pancreas and all of them were strongly down-regulated in 
liver progenitors (Supplemental Fig. 4). Additional de- 
terminants of noncanonical/planar cell polarity (PCP) 
Wnt specificity displayed similar expression profiles, in- 
cluding the receptors Fzd2 and Fzd7 and the PCP core 
membrane proteins Stbm/Vangl2, Celsr2, and Fatl (Fig. 2D). 
Finally, the coreceptors Lrp5 and Ror2 showed divergent 
expressions — one strongly induced in the liver, and the 
other one induced exclusively in pancreas progenitors 



(Fig. 2D). These results suggest a cell type-dependent non- 
canonical Wnt activation in the foregut and pancreatic 
progenitors that is ensured by not only distinct classes of 
ligands, but also recruitment of unrelated coreceptors, as 
previously described in other contexts (Grumolato et al. 
2010). For instance, the differential expression of Ror2 
coreceptor might sustain noncanonical Wnt activation in 
foregut and pancreas progenitors but not in liver progenitors. 

Noncanonical Wnt signaling signature underlies 
the pancreas versus liver fate decision 

If multiple components of a pathway are expressed and 
the expression is developmentally regulated, it is very 
likely that the pathway is active. Therefore, we under- 
took several approaches to study the possible role of the 
Wnt noncanonical pathway in the process of pancreas 
versus liver cell fate specification. For further analysis, 
we prioritized FP group transcripts that showed signifi- 
cant differential regulation between the pancreatic and 
hepatic endoderm (150 transcripts out of 517; Cufflinks, 
P-value < 0.05) (Figs. 2D, 3A). We performed quantitative 
RT-qPCR and immunofluorescence staining to confirm 
that the transcripts identified as differentially expressed 
by RNA-seq indeed showed significant enrichment in the 
pancreatic cell lineage (Fig. 3B,C ; Supplemental Fig. 5). In 
Drosophila as well as some mammalian tissues, the PCP 
proteins are initially enriched in the apical cell mem- 
brane prior to their asymmetric distribution at either the 
proximal or distal side of the cell (Vichas and Zallen 201 1; 
Wallingford 2012). Their distribution within the plane of 
the foregut epithelium has not been previously reported. 
We found Fzd2 receptor, Ror2 coreceptor, and PCP pro- 
teins such as Celsr2 and Fatl to be enriched at the cell 
surface of both foregut epithelial and pancreatic progen- 
itor cells with no obvious asymmetric distribution but 
completely absent in hepatic progenitors (Fig. 3Q Sup- 
plemental Fig. 5A). 

In several systems, there is direct evidence that non- 
canonical Wnt signaling regulates cell adhesion and cy- 
toskeleton components (Karner et al. 2009). Accordingly, 
in our data set, several adhesion molecules — including 
Claudin 4, laminin, integrin a3 (Itga3), fibronectin leu- 
cine-rich transmembrane protein 2 (Flrt2), and Flrt3 — as 
well as cytoskeleton components such as cytokeratins 
CK19 and CK7 and Shroom3 showed similar differential 
expression, and were down-regulated in hepatocyte pro- 
genitors after lineage divergence (Fig. 3B,C; Supplemental 
Fig. 5B). Taken together, these results indicate that active 
remodeling of the foregut epithelium accompanies cell 
fate determination between liver and pancreas, resulting 
in differences in the relative adhesiveness of the cells. 
Moreover, our findings suggest that the noncanonical 
Wnt signaling might influence this cell fate decision 
within the foregut endoderm. 

One prediction from this hypothesis is that upon ex- 
posure to noncanonical Wnt, endodermal cells acquire 
a pancreatic-specific differentiation program and fail to 
induce hepatic genes expression. To test this prediction, 
we first used one of the most intensively studied 
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Figure 2. Temporal and spatial integration analysis of the RNA-seq-derived transcriptome profiles. [A) PCA shows that the RNA-seq- 
derived transcriptome profiles are characteristic of different progenitor cell types (for detailed description, see the Materials and 
Methods). [B) Venn diagrams showing the number of unique and common highly expressed transcripts between progenitor cells at 
different developmental stages. To further focus our analysis on subsets of genes with distinct expression patterns, we divided the 
working data set into two groups based on a cutoff for high expression (defined as FPKM = 10, at approximately the 50th percentile of 
RNA-seq expression for each sample). Of 14,053 genes, 8110 could be categorized in the defined Venn regions. For example, 5437 genes 
exhibited relative abundance values of >10 FPKM in all samples, while 517 genes were highly expressed in the foregut, vpa, and dpa but 
not in the liver (referred to as group FP). In contrast, 89 genes were highly expressed in the foregut and liver but not in the vpa and dpa. 
Three-hundred-sixty-three transcripts were present (>10 FPKM) only in the fg but not in the liver, dpa, and vpa (not shown in the 
diagram). As shown in the PCA plot in A, fg and mfg were highly similar,- therefore, these samples were combined together as "foregut " 
to simplify the visualization. (C,D) Levels of Wnt signaling pathway gene expression across the fg, liver (lv), vpa, and dpa progenitors. 
FPKM values (Y-axis) were plotted against the different progenitors cell types (X-axis). (*) Wnt factors present in the 150 FP group 
transcripts that showed significant differential regulation between the pancreas and liver. 



noncanonical Wnts, Wnt5a, which we also found to be 
cell-autonomously produced by the E8.5 foregut en do derm 
(Supplemental Table 1; Supplemental Fig. 4A-C). Sub- 
sequently, at El 0.5, Wnt5a expression was maintained at 
lower levels in pancreatic progenitor cells and was absent 
in hepatoblasts (Supplemental Fig. 4B,C). In addition to the 
endodermal expression, we found Wnt5a to be abundant in 
the surrounding mesenchyme at both E8.5 and El 0.5, 
suggesting potential autocrine and/or paracrine ligand 
activities (Supplemental Fig. 4C). The other noncanonical 
Wnts, Wnt5b and Wnt7b, were found to be abundant in 
pancreatic progenitor cells and not in hepatoblasts starting 
from El 0.5, while they were expressed at low levels in 
foregut cells (Supplemental Fig. 4A,B). 

Perturbations of Wnt5a signaling in Xenopus, zebra- 
fish, and mice strongly suggest that Wnt5a activates a 
conserved pathway that controls cell fate, movements, 
and polarity during development (Moon et al. 1993; Ho 



et al. 2012). Moreover, the expression pattern appears 
conserved across species because WntSa is expressed in 
endodermal as well as surrounding mesodermal cells in 
the Xenopus embryo at the time of foregut organogenesis 
(Supplemental Fig. 4D ; Zhang et al. 2013). Therefore, we 
first used the Xenopus laevis embryo as a model system 
and exposed anterior endodermal cells, which are fated 
to become either liver or pancreas, to soluble Wnt5a pro- 
tein. In addition to the expected morphogenetic defects, 
Wnt5a induced pancreatic progenitor gene expression, 
including Pdxl and Ptfla, and, to a lesser extent, pancre- 
atic differentiation markers, such as Glucagon (Gcg) and 
Trypsin (Fig. 4A,B). Concomitantly, Wnt5a treatment re- 
pressed expression of the hepatic genes Hex, Fori, and 
Albumin in the same endodermal cells without affecting 
other endodermal cell fates, as judged by unchanged 
Nkx6.2 (duodenum) and Sox2 (stomach) expression levels 
(Fig. 4B ; Chalmers et al. 2000; Dichmann and Harland 
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Figure 3. Analysis of candidate regulators of the pancreatic versus hepatic fate decision. [A) Heat map view of the FP group transcripts 
that were differentially expressed between any of two samples (150 transcripts out of 517; Cufflinks, P-value < 0.05). Colors represent 
high (red) or low (blue) expression values based on Z-score normalized FPKM values for each gene. White represents the average 
between red (high) and blue (low) expression values. Dashed boxes highlight gene sets validated by either RT-qPCR or immunoflu- 
orescence analyses. [B] RT-qPCR validation of a subset of differentially expressed genes of the FP group. Data were normalized to that of 
succinate dehydrogenase [SDHA] and are represented as fold change compared with the E8.5 foregut sample (set to 1 as calibrator). 
Error bars represent ±SEM. (C) Immunofluorescence analysis validated the exclusive localization of Celrs2, Claudin 4 (Cldn4), CK19, 
Fatl, Fzd2, and Ror2 in E8.5 foregut endoderm (see arrows in the insets) and pancreatic progenitors and their absence in the liver (see 
arrows). Micrographs show cross-sections of E8.5 and E10.5 mouse embryos. Bars, 50 fxm. (duo) Duodenum,- (hep) hepatic progenitors,- 
(lv) liver,- (nf) neural folds. 
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Figure 4. Noncanonical Wnt5a activity promotes pancreatic versus hepatic fate in the anterior endoderm. (A) Xenopus embryos 
injected with Wnt5a mRNA showed a shortened and mildly bent body at the tailbud stage, as previously described (Kim et al. 2005). (B) 
Endodermal explants were cultured in the presence of Wnt5a recombinant protein from stage 10, collected at the tadpole stage, and 
assayed for expression of the indicated pancreatic, hepatic, and duodenum/stomach genes by RT-qPCR analysis. Untreated anterior 
endodermal explants were used as control. Data were normalized to that of ornithine decarboxylase {ODC) and are represented as fold 
changes compared with untreated endoderm sample (set to 1 as calibrator). Error bars represent ±SEM. (C) Whole-mount double in situ 
hybridization analysis of Hex (light blue) and Ptfia (purple) in control and Wzzt5<2-injected Xenopus embryos at the tadpole stage. The 
arrow indicates Hex expression in the liver bud, and arrowheads indicate Ptfia expression in the two pancreatic buds (dorsal and 
ventral buds). Dashed lines mark expanded Ptfia expression in the injected embryos. Total number of injected embryos = 61; 41% 
showed visible expansion of Ptfia. (AE) Anterior endoderm; (PE) posterior endoderm. [D) RT-qPCR analysis of endodermal explants 
treated with Wnt5b (200 ng/mL) recombinant protein. Data were normalized to that of ODC and are represented as fold changes 
compared with untreated endoderm sample (set to 1 as calibrator). (£) RT-qPCR analysis of endodermal explants treated with 500 ng/mL 
Wnt3a recombinant protein. Data were normalized to that of ODC and are represented as fold changes compared with untreated 
endoderm sample (set to 1 as calibrator). [F) RT-qPCR analysis of direct downstream target genes of the Wnt/p-catenin pathway in 
endodermal explants treated with 500 ng/mL Wnt5a or 500 ng/mL Wnt3a recombinant protein. Data were normalized to that of ODC and 
are represented as fold changes compared with untreated endoderm sample (set to 1 as calibrator). (G) TOPFLASH and ATF2-luc reporter 
assays in Xenopus embryos. Four-cell stage embryos were injected into the vegetal blastomeres with 50 pg of TOPFLASH or 100 pg of 
ATF2-luc plus 25 pg of Renilla luciferase reporter plasmids. Endodermal explants were dissected at stage 9 and either left untreated as 
control (CTRL) or exposed to 500 ng/mL Wnt5a or 500 ng/mL Wnt3a recombinant protein, as indicated. Luciferase reporter assays were 
carried out in explants lysed at gastrula and early tailbud stages. [H) Western blot analysis of dissected anterior endodermal explants either 
left untreated as control (CTRL) or exposed to Wnt5a or Wnt3a recombinant protein. The relative ABC/tubulin levels in the treated 
explants compared with the control, which was set to 1.0, are indicated. ((3-cat) Total (3-catenin,- (tub) a-tubulin. (*) P < 0.05; (**) P < 0.01, 
as determined by the REST program statistical analysis (Pfaffl et al. 2002). 



201 1). Similar effects were observed when Wnt5a mRNA 
was microinjected into either anterior or posterior endo- 
dermal cells of the Xenopus embryo, resulting in ex- 



panded Ptfia expression beyond its normal boundaries 
(Fig. 4C). Furthermore, we found that activation of the 
noncanonical Wnt pathway using a different noncanonical 
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ligand, the Wnt5b recombinant protein, also induced a 
strong expression of pancreatic progenitor transcription 
factors in anterior endodermal cells (Fig. 4D). Finally, we 
examined whether the exposure to the noncanonical Wnt 
pathway has an effect on cell proliferation in the anterior 
endoderm. Immunostaining of phospho-histone H3 (PHH3) 
to mark cells undergoing mitosis revealed no significant 
differences in proliferating cell number in the foregut 
between Wnt5a-injected and uninjected early tadpole stage 
embryos (Supplemental Fig. 4E). 

On the other hand, the establishment of proper levels 
of canonical Wnt/p-catenin signaling and its temporal 
sequential activation in the anterior endoderm (Fig. 4) 
are known to be essential for foregut identity and organ 
formation (Ober et al. 2006; Li et al. 2008; Angers and 
Moon 2009; Puri and Hebrok 2010; Zhang et al. 2013). 
After adding Wnt3a recombinant protein, a well-known 
canonical Wnt activator (Grigoryan et al. 2008; Clevers 
and Nusse 2012), to anterior endodermal cells, we ob- 
served that the levels of expression of hepatic and pancre- 
atic genes were unchanged when compared with control 
endoderm, ruling out an apparent choice of cell type 
between liver or pancreas (Fig. 4E). 

The noncanonical Wnt signaling is able to antagonize 
the canonical signaling in certain biological contexts 
(Grumolato et al. 2010; Ho et al. 2012). To distinguish 
between these two possible Wnt5a activities (e.g., non- 
canonical Wnt activity or antagonist effect on Wnt/ 
P-catenin) in the context of liver and pancreas lineage 
divergence, we performed a series of quantitative assays 
and biochemical analysis in the Xenopus endoderm. First, 
we analyzed the transcriptional responses to both path- 
ways in endoderm cells treated with Wnt5a by ( 1 ) exam- 
ining direct downstream target genes of Wnt/p-catenin 
pathway and (2) using specific lucif erase reporter assays 
based on TCF/LEF (TOPFLASH) and ATF2 response 
elements for canonical and noncanonical Wnt, respec- 
tively (Ohkawara and Niehrs 2011; Clevers and Nusse 
2012). RT-qPCR analysis showed that Wnt3a treatment 
increased the expression levels of direct Wnt target genes 
such as Cyclin-Dl (Ccdnl), Lef-1, Myc, and Axin 2, 
whereas Wnt5a treatment does not affect their expres- 
sion in anterior endodermal cells (Fig. 4F). This is in line 
with the RNA-seq profiles of mammalian pancreatic and 
hepatic progenitor cells displaying comparable expres- 
sion levels of these downstream target genes (Fig. 2; 
Supplemental Table 2). Furthermore, we found that 
exposure to Wnt5a does not suppress endogenous ca- 
nonical Wnt activity in the endoderm, as monitored by 
the Wnt/p-catenin TOPFLASH reporter assay, whereas 
it induces the noncanonical Wnt ATF2 reporter activity 
(Fig. 4G). Notably, endogenous canonical Wnt/p-catenin 
transcriptional activity accumulates in the Xenopus endo- 
derm from gastrulation onward (Fig. 4G; Angers and Moon 
2009; Ohkawara and Niehrs 201 1 ). This results in elevated 
basal levels of TOPFLASH luciferase activity in the 
untreated endodermal cells (control [CTRL]) and is 
consistent with the mild induction of the TOPFLASH 
reporter observed upon exposure to Wnt3a-soluble pro- 
tein (Fig. 4G). 



Next, we conducted biochemical assays on endoderm 
explants treated with Wnt5a and Wnt3a to analyze the 
status of activation of p-catenin signaling. Western blot 
analysis showed that levels of total and active p-catenin 
(ABC) protein (dephosphorylated on Ser37 and Thr41) 
(van Noort et al. 2002) remained unchanged in Wnt5a- 
treated anterior endodermal cells when compared with 
control endoderm (Fig. 4H), whereas Wnt3a moderately 
increased dephosphorylated p-catenin levels. As ex- 
pected, Wnt5a induced JNK phosphorylation, reflecting 
noncanonical Wnt activation (Fig. 4H). Taken together, 
these results rule out Wnt5a function as an antagonist 
of the canonical Wnt pathway and support a role of the 
noncanonical Wnt signaling in the pancreatic versus 
hepatic fate decision in a p-catenin-independent manner. 

Last, we sought to expand the functional analysis on 
the noncanonical Wnt pathway directly to mammalian 
endodermal cells. First, we used the mouse embryonic 
stem cell (mESC) system to model ex vivo endoderm 
development (Rossant 2011). In particular, to induce 
pancreatic specification, mESCs were stimulated using 
a step-wise protocol in a monolayer culture adapted from 
previously published studies (D'Amour et al. 2006; 
Nostro et al. 2011; Wang et al. 2011; Chen et al. 2013). 
RT-qPCR analysis revealed first up-regulation of defini- 
tive endoderm transcription factors such as Soxll and 
Foxa2 followed by sequential induction of pancreatic 
progenitor genes, including Pdxl, Sox9, Pax6, and Islet 1 
[Isll] (Fig. 5A). Consistent with pancreatic differentiation, 
the levels of expression of Sox9, Pax6, and Isll increased 
with time and upon exposure to additional cytokines 
(e.g., noggin, retinoic acid, and cy dopamine), while no or 
minimal induction of the hepatic marker Albumin was 
detected (Fig. 5A). Most differentiation protocols use 
FGF10 to pattern the definitive endoderm-induced pop- 
ulation toward the pancreatic endoderm fate within the 
foregut and the subsequent pancreatic progenitor fate 
(D'Amour et al. 2006; Nostro et al. 2011). To determine 
whether the noncanonical Wnt pathway would enhance 
pancreatic specification, we examined the consequences 
of treating definitive endoderm cultures generated from 
ESCs with Wnt5a in the presence or absence of FGF10. 
Interestingly, Wnt5a enhanced the levels of Pdxl expres- 
sion compared with the standard pancreatic endoderm 
differentiation conditions (Fig. 5B). In addition, the posi- 
tive effect of Wnt5a was even more evident in definitive 
endoderm cells cultured in the absence of FGF10, as its 
addition restored the induction of Pdxl expression to 
pancreatic endoderm standard levels (Fig. 5C). Impor- 
tantly, the duodenal marker Cdx2 was not induced in 
pancreatic endoderm cultures differentiated in the pres- 
ence of Wnt5a, and the slight induction of Albumin 
expression in pancreatic endoderm cultures was down- 
regulated by Wnt5a treatment, supporting a specific ef- 
fect on pancreatic fate (Fig. 5B). In addition, we found that 
other pancreatic progenitor transcription factors, includ- 
ing Nkx6.1 (at the pancreatic endoderm stage), Sox9, and 
Pax6 (at the pancreatic progenitor stage), were strongly 
induced in both the absence and presence of Wnt5a, 
although a further enhancement of their transcript levels 



GENES & DEVELOPMENT 1939 



Rodriguez-Seguel et al. 



A dO 



d2 



d5 



d8 



<L) 8 
?7 

5 6 

s 5 

<u 3 

i - 

06 0 



B 



□ ESC (dO) 

□ d2 



mn-imn 





14, 




□ ESC(dO) 35 1 




i 


12- 




i 


□ d5 30- 






10- 






25- 






8- 






20- 






6- 






15« 






4- 






10- 






2- 

- 0- 






rTl rTl rTl n-i jj. 



□ ESC (dO) 

□ d8 



U) rM 

O X <V 

C O x 

-5 * £ 



• — ON VO ' — 



— < 




100 



90 



80 



70 



a, 60 

CD 

| 50 
5 40 



0; 

1 2 



□ ESC (dO) 

□ PE (d5) 

■ PE(d5) + Wnt5a 



100 
90 
80 
70 



fl 



ri 



** 

** 

t 



□ ESC (dO) 

□ PP (d8) 

■ PP(d8) + Wnt5a 



□ PE(d5) 

□ PE-FGF10 
■ PE + Wnt5a 

□ PE-FGF10 + Wnt5a 



Pdx1 Cdx2 Nkx6.lHNF!3 Alb 




Pdxl 



12 



c 

5 3 
u 

<D 2 
> 

oc l 



I □ untreated liver cells 

I ** ■ Wnt5a-liver cells 

H 

nrlrlrlnrinninni 



Sox9 Pdx1 Pax6 MafA Hes1 Alb HNF4a1 A1AT Proxl Hex 

Figure 5. Conserved Wnt5a activity in promoting pancreatic fate. [A) Directed differentiation of mESC monolayer cultures into 
pancreatic progenitors. RT-qPCR analysis evaluating definitive endoderm (DE), pancreatic endoderm (PE), and pancreatic progenitor 
(PP) gene expression at different stages of differentiation. Untreated mESCs were used as control (dO). Data were normalized to that of 
SDHA and are represented as fold changes compared with control (dO) mESCs (set to 1 as calibrator). (5) Day 5 and day 8 mESC cultures 
were analyzed by RT-qPCR for the expression of the indicated genes following either standard pancreatic endoderm and pancreatic 
progenitor culture conditions or in the presence of Wnt5a recombinant protein (PE + Wnt5a and PP + Wnt5a). RT-qPCR data were 
normalized to that of SDHA and are represented as fold changes compared with control (dO) mESCs (set to 1 as calibrator). Error bars 
represent ±SEM. (C) Day 5 mESC cultures were analyzed by RT-qPCR for the expression of the pancreatic gene Pdxl following 
standard pancreatic endoderm culture conditions in the absence of FGF10 (PE — FGF10), the presence of Wnt5a (PE + Wnt5a), or the 
presence of Wnt5a but without FGF10 (PE - FGF10 + Wnt5a). RT-qPCR data were normalized to that of SDHA and are represented as 
fold changes compared with standard pancreatic endoderm condition (set to 1 as calibrator). Error bars represent ±SEM. [D] BAML liver 
cells cultured in the presence of 200 ng/mL Wnt5a for 2 wk were assayed for expression of the indicated pancreatic and hepatic genes by 
RT-qPCR analysis. Untreated BMAL liver cells were used as control. Data were normalized to that of SDHA and are represented as fold 
changes compared with untreated liver cells (set to 1 as calibrator). Error bars represent ±SEM. (*) P < 0.05; (**) P < 0.01, as determined 
by the REST program statistical analysis (Pfaffl et al. 2002). (Alb) Albumin. 
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(as seen for Pdxl) was not detectable in these culture 
conditions (Fig. 5B). This finding suggests that patterning 
of anterior endoderm with noncanonical Wnt signaling 
might be important for optimal pancreatic specification 
in protocols for directed differentiation of ESCs. 

Next, we asked whether modulation of the noncanon- 
ical Wnt pathway might promote the pancreatic program 
in differentiated liver cells. To this aim, we used a mam- 
malian ex vivo model system, the nontransformed BAML 
hepatic cell line (Fougere-Deschatrette et al. 2006). The 
BAML cells were established from adult mouse livers and 
were shown previously to retain hepatic differentiation 
hallmarks and repopulate the liver in vivo (Fougere- 
Deschatrette et al. 2006). We cultured the liver cells in 
their standard hepatocyte culture medium in the pres- 
ence or absence of Wnt5a. After prolonged exposure to 
Wnt5a, we observed significant induction of the pancre- 
atic genes, including Pdxl, Pax6, and MafA, in liver cells, 
whereas the level of expression of the liver markers 
Albumin and al -antitrypsin [A1AT) and the liver-specific 
transcription factor HNF4al was not affected (Fig. 5C). 
This suggests that Wnt5a might also facilitate fate con- 
version of liver cells into pancreatic fate but does not 
itself possess the ability to suppress hepatic identity. 

All together, these results implicate a novel and con- 
served role for the noncanonical Wnt signaling pathway 
in promoting the pancreas versus liver fate decision in the 
endoderm. In particular, Wnt5a appears to exert specific 
regulation on Pdxl gene expression, which is conserved 
in both the mouse and Xenopus endoderm (Figs. 4, 5). An 
entire set of noncanonical Wnt transducers was identified 
in our analysis (Figs. 2, 3); whether Wnt5a and Wnt5b 
activities in this cell lineage decision are due to signaling 
through components of the PCP, Ror2/JNK, or Calcium/ 
NFAT pathways (Kim et al. 2005; Ober et al. 2006; Li 
et al. 2008; Puri and Hebrok 2010) deserves further 
investigation. 

Spatially distinct expression patterns mark 
the progenitor populations 

Our data provide an opportunity to systematically exam- 
ine additional in vivo expression patterns within the 
hepatic and pancreatic endoderm. For example, we fo- 
cused on identifying transcriptional differences between 
discrete domains of the foregut endoderm at E8.5 (fg vs. 
mfg), before organ rudiments were formed (Fig. 6). Only 
21 transcripts displayed statistically significant differen- 
tial expression between these two regions (Fig. 6A). 
The small number of differentially expressed transcripts 
might reflect the active movements and intermingling of 
cells across the foregut at this stage (Tremblay and Zaret 
2005). Among them were only a few genes that had been 
previously reported in the endoderm, including Pyy, Otx2, 
and Sox2 (Hou et al. 2007), but the majority was not 
known to be expressed in this territory (Fig. 6A-D). The 
expression of some of these foregut genes was maintained 
in either liver progenitors (E2f2), ventral pancreatic pro- 
genitors (Celsr2 and Nrlh5), or both (Cdknlc), represent- 
ing interesting candidates for lineage tracing studies to 



establish the exact contribution of progenitor domains 
within the endoderm. 

Notably, in the ventral foregut epithelium, we detected 
Hox genes encoded by the HoxD clusters (Hoxd3 and 
Hoxd4) as well as a spliced noncoding RNA that is 
transcribed antisense to the Hoxb5 and Hoxb6 genes 
(0610040B09Rik), which might be relevant for their 
regulation (Fig. 6A ; Supplemental Fig. 6; Dinger et al. 
2008). A clear spatial difference in the expression of Hox 
genes was evident also at later stages in the pairwise 
comparison between El 0.5 vpa and dpa data sets (Supple- 
mental Fig. 7). Hox genes such as Hoxa3, Hoxa4, Hoxal , 
Hoxb3, Hoxb6, Hoxbl , and Hoxb8 were abundantly 
expressed in the dorsal pancreatic region but were absent 
or at undetectable levels in the vpa and liver (Fig. 6A; 
Supplemental Fig. 7). Using RT-qPCR and in situ hybrid- 
ization, we confirmed the differential expression between 
the dpa and vpa of selected Hox genes, including Hoxal , 
Hoxb6, and Hoxb8 (Fig. 6F-I). Notably, Hoxal and Hoxb6 
seem to be expressed in a heterogenous manner in the 
dpa, possibly marking specific cell types (Fig. 6F,G). 
Taken together, our findings suggest that a specific Hox 
code exists in the developing foregut as well as within the 
pancreatic territory, and its biological significance needs 
to be assessed (Grapin-Botton and Melton 2000; Iimura 
et al. 2009). 

Besides the Hox genes, we detected large differences in 
gene expression between the ventral and dorsal pancreatic 
buds at the onset of pancreas organogenesis (192 tran- 
scripts,- absolute value of log2 fold change > 3, P-value < 
0.05) (Supplemental Fig. 7). To determine whether there 
was any recognizable biological relevance to the expression 
patterns, we analyzed GO terms with respect to molecular 
functions and found significantly enriched annotations in 
each cluster. Strong association with extracellular bind- 
ing activities (GO: 0005488; 62 genes,- P-value < 0.000129) 
was found in the vpa cluster, whereas the dpa cluster 
showed strong enrichment for DNA-binding transcrip- 
tion factor function (GO: 0003700; 25 genes,- P- value < 
9.41X10" 18 ). 

Our comprehensive analysis further illustrates how 
different the dorsal and ventral pancreatic progenitors 
are from each other. Despite these differences at El 0.5, 
both pancreatic rudiments give rise to endocrine and 
exocrine cells at later stages and eventually fuse to form 
the definitive pancreas (Puri and Hebrok 2010). Our data 
set will enable a clear understanding of how distinct 
transcriptional programs might lead to the same cell type, 
allowing, for instance, the identification of alternative 
modes that can be used to program stem cells toward 
pancreatic endocrine fates. 

Concordant gene signatures between mouse 
and human pancreatic progenitors 

Next, we sought to evaluate the evolutionary impor- 
tance of our transcriptome analysis of pancreatic pro- 
genitor cells and its relevance to human pancreas de- 
velopment. Given the fact that genomic analysis on 
equivalent developmental stage human material is not 
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Figure 6. Identification of distinct spatial patterns within the mouse foregut endoderm. [A] Heat map view of the transcripts that 
showed significant differential expression between E8.5 fg and mfg (21 transcripts). Colors represent high (red), low (blue), or average 
(white) expression values based on Z-score-normalized FPKM values for each gene. (B) RT-qPCR validation of a subset of the foregut 
differentially expressed genes. The XLOC_0 19271 is not supported by any spliced ESTs or Genscan predictions. By exon junction 
analysis, we predicted a gene model and validated the expression of this novel transcript in the fg and dpa by RT-qPCR (see also Fig. 3B ; 
Supplemental Fig. 8). Data were normalized to that of SDH A and are represented as fold change compared with the E8.5 fg sample (set 
to 1 as calibrator). Error bars represent ±SEM. [C-H) Immunofluorescence and in situ hybridization analyses validated the expression of 
the indicated genes in the E8.5 foregut endoderm (see arrowheads) and/or dorsal pancreatic rudiments (demarcated by dashed line). At 
E8.5, Otx2 expression was detected in the mfg (C), which coexpressed Soxl7 and E-cadherin (Ecad), as shown by immunofluorescence 
staining on serial section (C). FoxD3 expression was detected in the E8.5 fg endoderm [D; see arrowheads) and dpa cells at E10.5 (£). 
Arrows in E indicate FoxD3/Pdxl -double-positive cells. At E10.5, Hox gene expression was detected in the dorsal pancreatic rudiment 
(F-H), which coexpressed Pdxl and E-cadherin (Ecad), as shown by immunofluorescence staining on serial section (F). Bars, 50 |jim. (7) 
RT-qPCR validation of the indicated Hox genes showed differential expression between the vpa and dpa. Data were normalized to that 
of SDHA and are shown as expression ratio (2-log values) of dpa sample versus vpa sample. Error bars represent ±SEM. (/) The heat map 
shows the expression of 100 mouse genes found to be expressed in both the vpa and dpa (>10 FPKM) and whose human orthologs are 
also expressed in human pancreatic "progenitor-like" cells (Micallef et al. 2012). 



1942 GENES & DEVELOPMENT 



RNA-seq of hepatic and pancreatic progenitor cells 



available, we compared our murine RNA-seq data sets 
with available microarray data obtained from human 
ESC (hESC (-derived pancreatic-like cells (Micallef et al. 
2012). 

Human pancreatic progenitor state was defined by the 
top 400 up-regulated genes in hESC-derived pancreatic 
cells versus undifferentiated hESCs (Micallef et al. 2012). 
Importantly, we restricted our analysis only to those 
genes that were up-regulated in hESCs differentiated into 
a specific pancreatic differentiation stage that precedes 
the onset of overt (3-cell differentiation, representing 
the closest gene signature of human pancreatic progeni- 
tors. This corresponds to the fraction of "FACS-sorted 
insulin-GFP-negative cells" obtained after application of 
published differentiation protocols to undifferentiated 
INS GFP/w hESCs (Micallef et al. 2012). Mouse genes from 
the RNA-seq experiments were matched to their corre- 
sponding human homologs using the NCBI database 
HomoloGene (http://www.ncbi.nlm.nih.gov/homologene,- 
see also the Materials and Methods). Importantly, we 
found that that 25% of the 400 human homolog genes 
are also markers of pancreas progenitors in mice, display- 
ing high gene expression in the El 0.5 vpa and dpa data 
sets despite the methodological and temporal differ- 
ences (Fig. 6J). 

Discussion 

In conclusion, we devised a strategy to purify liver and 
pancreas progenitors directly from the endoderm of 
Tg(Proxl-EGFP) reporter mouse embryos. Using RNA- 
seq, we profiled the purified progenitor cells and iden- 
tified transcripts that are differentially expressed across 
the pancreatic and hepatic endoderm at distinct develop- 
mental stages. Importantly, we uncovered a unique non- 
canonical Wnt signaling signature in the emergence of 
pancreas versus liver from endoderm progenitors that 
appears conserved across phylogenetically distant spe- 
cies. Our results provide an invaluable resource for 
lineage tracing analysis to pinpoint the exact origin of 
the hepato-pancreatic lineage and isolation of transient 
progenitor cell populations. Moreover, our transcrip- 
tional data lay the foundation for further targeted func- 
tional studies of developmental regulators of the liver and 
pancreas fate decision that will be relevant to humans. 
Indeed, when we compared our RNA-seq results with 
those previously obtained using microarray in hESCs 
differentiated into pancreatic-like progenitors (Micallef 
et al. 2012), we identified concordant gene signatures 
(e.g., similar markers of pancreatic progenitors: Wis, 
FLRTs, and Meis). These results suggest direct impli- 
cations of our findings in developing novel strategies to 
generate pancreas progenitors and (3 cells for clinical 
transplantation from cellular programming of stem 
cells or induced pluripotent stem cells (iPSCs) and from 
reprogramming of adult hepatic cells. Finally, our 
RNA-seq analysis might be used as an in vivo reference 
for direct comparison with diseased human tissues 
such as pancreas dysplasia or agenesis or congenital liver 
defects. 



Materials and methods 

Mouse embryo dissection and FACS sorting 

The transgenic mouse line Tg(Proxl-EGFP) Gsat/Mmcd was ob- 
tained from the Heintz Laboratory-Gensat Project and generated 
as previously described (Gong et al. 2003). E8.5 and E10.5 Proxl- 
EGFP-positive embryos were selected and dissected using an 
epifluorescence stereomicroscope (Discovery VI 2, Zeiss). Eye- 
brow knives or tweezers were used for cutting the GFP-positive 
embryonic regions. The dissected embryonic regions were disso- 
ciated by trypsin (trypsin/EDTA 0.25% solution) digestion into 
single-cell suspension. To stop the reaction, DMEM (Invitrogen) 
was added to the cell suspensions and centrifuged at 300g at 4°C, 
and pellets were suspended and PBS/DEPC-treated. Before sort- 
ing, propidium iodide was added to select only live cells, and the 
cell suspension was filtered through a BD Falcon tube with cell 
strainer cap (BD 352235). After dead cell exclusion (SSC-A/PI-A), 
GFP-expressing cells were sorted at 4°C using a FACS Aria III 
flow cytometer (BD Biosciences) using a GFP filter and by setting 
the gate on the GFP fluorescence intensity. Conditions of sorting 
were as follows: 70-|xm nozzle and sheath pressure of 70 psi. 
Sorted cells were collected directly in TRIzol reagent (Invitrogen) 
for RNA extraction. Cells isolated by FACS were used for RNA 
isolation and RNA-seq and subsequently for RT-qPCR valida- 
tion. Approximately, a total of 10,000 cells per sample were 
pooled to isolate 100 ng of high-quality total RNA. All animal 
experimentation was conducted in accordance with the local 
ethics committee for animal care. 



RNA-seq 

RNA-seq was performed using —70 ng of total RNA quantified 
by Agilent RNA 6000 Pico kit (Agilent Technologies). The qual- 
ity of RNA samples prior to library preparation was determined 
using an Agilent Bioanalzyer, and only samples with high RIN 
(RNA integrity number) scores (>8.5) were further processed. 
Briefly, poly(A) RNA was isolated by two rounds of oligo(dT)25 
Dynabeads (Invitrogen) purification. Purified poly(A) RNA was 
fragmented for 3.5 min at 94°C using 5X fragmentation buffer 
(200 mM Tris-acetate at pH 8.1, 500 mM KOAc, 150 mM MgOA) 
as described previously (Adamidi et al. 2011). The fragmented 
RNA was purified by Agencourt RNAClean XP SPRI beads 
(Agencourt) and converted to first strand cDNA using random 
hexamer primers (Invitrogen) and Superscript II reverse tran- 
scriptase (Invitrogen) followed by second strand cDNA synthe- 
sis with Escherichia coli DNA polymerase I (Invitrogen) and 
RNase H (Invitrogen) according to the manufacturer's intstruc- 
tions. The 76-nucleotide (nt) paired-end sequencing library was 
prepared using New England Biolabs Next DNA Library prep- 
aration kit following the Illumina mRNA-Seq library prepara- 
tion protocol. The minimum necessary number of cycles of 
amplification was used to minimize amplification biases (15 
cycles with no overamplification observed). The quality of cDNA 
libraries for sequencing was assessed using Agilent Technologies 
2100 Bioanalyzer. The prepared sequencing library was subse- 
quently sequenced on Illumina HiSeq for 2 X 100 cycles following 
the standard protocol (Adamidi et al. 201 1). 

Bioinformatics analysis 

Mapped reads from TopHat (version 1.33) were used as input for 
Cufflinks (version 1.3.0) (Trapnell et al. 2010) for transcript 
assembly and differential expression using the University of 
California at Santa Cruz mm9 reference annotation. To obtain 
a working data set for the purposes of comparing transcriptomes 
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between different samples, we filtered the Cufflinks gene set for 
(1) successful deconvolution of fragments per kilobase of exon 
per million mapped fragments (FPKM), (2) FPKM > 0 in at least 
two samples, and (3) lower confidence level of FPKM > 0 in at 
least two samples. In the case that multiple Cufflinks genes for 
one gene symbol existed, Cufflinks genes were further selected 
for greatest transcript length and highest variability in the FPKM 
across all samples. This resulted in a nonredundant gene set of 
14,053 Cufflinks genes. 

Unless otherwise noted, bioinformatic analysis was carried 
out using R statistical environment (version >2.15; http:// 
www.r-project.org). For the PCA of the nonredundant working 
data set, a ceiling FPKM value was set as the 90th percentile of 
all expression values in the data set. The first two principal 
components exhibited 48% and 35% of the total variability for 
PCI and PC2, respectively. The Venn diagram was generated 
using the R packages limma (Ritchie et al. 2007) and VennDiagram 
(Adamidi et al. 2011). Enriched GO categories were calculated 
using the Bioconductor package TopGO (Alexa et al. 2006). In each 
case, the enrichment of terms in the group FP was compared with 
all genes in the working data set with annotations in their re- 
spective ontologies. 

To demonstrate relevance with human pancreatic develop- 
ment, we obtained normalized microarray data from a previously 
published study of hESC differentiation to pancreatic cells 
(Micallef et al. 2012). Differential gene expression (at a false dis- 
covery rate [FDR] adjusted P- value < 0.05) between the differen- 
tiated pancreas-like cells ("Nostro protocol 77 ) and hESC lines was 
determined using the R statistical package limma (Smyth 2005). 
Mouse genes from the RNA-seq experiments were matched to 
their corresponding human homologs using the NCBI database 
HomoloGene (http://www.ncbi.nlm.nih.gov/homologene). The 
RNA-seq data sets have been deposited in NCBI's Gene Expres- 
sion Omnibus (GEO) and are accessible through accession number 
GSE40823. 

Immunohistochemistry and in situ hybridization 

Mouse embryos were fixed in 4% paraformaldehyde from 2 h to 
overnight at 4°C. Subsequently, samples were equilibrated in 
20% sucrose solution and embedded in OCT compound (Sakura). 
In situ hybridization on cryostat sections was done as in 
Schaeren-Wiemers and Gerfin-Moser (1993). Cryosections (10 
|xm) were incubated with TSA (Perkin Elmer) blocking buffer for 
1 h at room temperature and afterward with primary antibodies 
at an appropriate dilution (Supplemental Table 4). All confocal 
images were acquired with an LSM 700 confocal laser-scanning 
microscope (Zeiss). 

RT-qPCR 

For RNA isolation, embryonic tissues were microdissected, 
FACS-sorted, and collected in TRIzol reagent (Invitrogen). Sub- 
sequently, RNA was extracted from a minimum of 3000 cells 
according to the manufacturer's instructions (Roche). Briefly, 
total RNA was resuspended in 10 juuL of DEPC H 2 0 and processed 
for reverse transcription using Transcriptor First Strand cDNA 
Synthesis kit (Roche). A mix of anchored oligo(dT)18 and random 
hexamer primers was used to generate the cDNA. Thermolabile 
nuclease from the Real-Time Ready Cell Lysis kit (Roche) was 
added to the reverse transcription reaction to degrade dsDNA. 
Real-time PCR reactions were carried out using the SYBR Green 
Master mix (Roche) on the ABI StepOne Plus system. Succinate 
dehydrogenase [SDH A) was used as reference gene. All of the 
values were normalized to the reference gene and calculated 
using the REST program (Pfaffl et al. 2002). Data were deter- 



mined in triplets. All experiments were repeated at least three 
times unless otherwise stated. Primer sequences can be obtained 
on request. 

Xenopus embryo experiments 

Xenopus embryo manipulations and dissections were performed 
as described (Spagnoli and Brivanlou 2008). The SP64T-xWnt5a 
plasmid used for microinjection was previously described (Moon 
et al. 1993). For whole endodermal explant experiments, endo- 
dermal cells were microdissected at stage 9 and cultured in 0.1 X 
MMR in the presence of the indicated recombinant proteins 
until the desired stage. For anterior endodermal explants, dorsal 
vegetal cells were microdissected at stage 10 and cultured as 
above. Ornithine decarboxylase [ODC] was used as a reference 
gene in Xenopus RT-qPCR analysis. All of the values were 
normalized to the reference gene and calculated using the REST 
program (Pfaffl et al. 2002). For lucif erase reporter assays in 
Xenopus, embryos were injected with ATF2-luc (Ohkawara and 
Niehrs 2011) or TOPFLASH and Renilla-TK plasmid DNA 
(Clevers and Nusse 2012). Three pools of 15 explants each were 
lysed with passive lysis buffer and assayed for luciferase activity 
using the Dual Luciferase system (Promega). Recombinant mouse 
Wnt5a, Wnt5b, and Wnt3a (R&D System) proteins were used at 
the indicated concentrations in both Xenopus and cell culture 
experiments. All experiments were repeated at least three times 
unless otherwise stated. 

Cell culture experiments 

mESCs (Rl mESC line) were maintained on gelatin-coated plates 
with mouse embryonic fibroblasts (MEFs) in standard mESC 
medium: DMEM (Invitrogen), 2 mM glutamax (Invitrogen), 
1 mM sodium pyruvate (Invitrogen), 0.1 mM nonessential amino 
acids (Invitrogen), 15% fetal bovine serum (FBS) (PAN Biotech), 
0.1 mM (3-mercaptoethanol (Sigma), and 1000 U/mL leukemia 
inhibitory factor (ESGRO). For differentiation, cultures were 
MEF-depleted and seeded in mESC medium at high confluency 
on gelatin-coated dishes. Monolayer differentiation was carried 
out as described previously (D'Amour et al. 2006; Nostro et al. 
201 1). Briefly, definitive endoderm medium to day 2 consisted of 
RPMI medium (Invitrogen) and 0.2% FBS supplemented with 50 
ng/mL Activin A and 25 ng/mL Wnt3a at day 1 and Activin A at 
day 2. Pancreatic endoderm medium to day 5 consisted of RPMI 
medium and 2% FBS supplemented with 3 ng/mL Wnt3a and 
50 ng/mL FGF10. Pancreatic progenitor medium to day 8 consisted 
of DMEM with 1 % (v/v) B27 supplement (Invitrogen), 50 |xg/mL 
ascorbic acid (Sigma), 0.25 |jlM KAAD-cy dopamine (Toronto Re- 
search Chemicals), 2 |jlM all-trans retinoic acid (Sigma), and 
50 ng/mL noggin. All recombinant proteins were purchased from 
R&D System unless otherwise stated. The BAML cells were 
cultured as previously described (Fougere-Deschatrette et al. 2006). 

Western blot analysis 

For Western blot analysis, endodermal cells were lysed as de- 
scribed (Spagnoli and Brivanlou 2008). Immunoblots were in- 
cubated with anti-ABC monoclonal antibody, which recognizes 
the dephosphorylated (3-catenin on Ser37 and Thr41 (Millipore); 
anti-p-catenin polyclonal antibody (Santa Cruz Biotechnology); 
anti-pJNK monoclonal antibody (Santa Cruz Biotechnology); 
and anti-a-tubulin monoclonal antibody (Sigma) and analyzed 
using the LI-COR Odyssey system. 

Statistical tests 

All results are expressed as mean ± standard errors. The 
significance of differences between groups was evaluated with 



1944 GENES & DEVELOPMENT 



RNA-seq of hepatic and pancreatic progenitor cells 



Student's t-test. P < 0.05 was considered statistically significant. 
Statistical tests relevant to the RNA-seq data set analysis are 
described in the "Bioinformatic Analysis 77 section. 
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